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Abstract. In this paper, we present a number of network-analysis al- 
gorithms in the external-memory model. We focus on methods for large 
naturally sparse graphs, that is, n-vertex graphs that have O(n) edges 
and are structured so that this sparsity property holds for any subgraph 
of such a graph. We give efficient external-memory algorithms for the 
following problems for such graphs: 

1. Finding an approximate d-degeneracy ordering. 

2. Finding a cycle of length exactly c. 

3. Enumerating all maximal cliques. 

Such problems are of interest, for example, in the analysis of social 
networks, where they are used to study network cohesion. 

1 Introduction 

Network analysis studies the structure of relationships between various entities, 
with those entities represented as vertices in a graph and their relationships 
represented as edges in that graph (e.g., see [TJ). For example, such structural 
analyses include link-analysis for Web graphs, centrality and cohesion measures 
in social networks, and network motifs in biological networks. In this paper, 
we are particularly interested in network analysis algorithms for finding various 
kinds of small subgraphs and graph partitions in large graphs that are likely to 
occur in practice. Of course, this begs the question of what kinds of graphs are 
likely to occur in practice. 

1.1 Naturally Sparse Graphs 

A network property addressing the concept of a "real world" graph that is 
gaining in prominence is the k-core number |25j . which is equivalent to a graph's 
width [IB], linkage [TH], k-inductivity [TS], and k- degeneracy [2121) . and is one 
less than its Erdos-Hajnal coloring number [T?]. A fc-core, G' , in a graph, G, is 
a maximal connected subgraph of G such that each vertex in G' has degree at 
least k. The fc-core number of a graph G is the maximum k such that G has a 
non-empty /c-core. We say that a graph G is naturally sparse if its fc-core number 
is 0(1). This terminology is motivated by the fact that almost every n-vertex 
graph with 0[n) edges has a bounded fc-core number, since Pittel et al. 23J show 
that a random graph with n vertices and an edges (in the Erdos-Renyi model) 
has fc-core number at most 2c + o(c) , with high probability. Riordan [24] and 



Fernholz and Ramachandran |15j have also studied fc-cores in random graphs. 
In addition, we also have the following: 

— Every s-vertex subgraph of a naturally sparse graph is naturally sparse, 
hence, has O(s) edges. 

— Any planar graph has fc-core number at most 5, hence, is naturally sparse. 

— Any graph with bounded arboricity is naturally sparse (e.g., see |10]h 

— Eppstein and Strash [T3] verify experimentally that real-world graphs in 
four different data repositories all have small k-core numbers relative to 
their sizes; hence, these real-world graphs give an empirical motivation for 
naturally sparse graphs. 

— Any network generated by the Barabasi- Albert [J preferential attachment 
process, with m € 0(1), or as in Kleinberg's small-world model [20], is 
naturally sparse. 

Of course, one can artificially define an n-vertex graph, G", with 0{n) edges that 
is not naturally sparse just by creating a clique of 0(n x / 2 ) vertices in an n-vertex 
graph, G, having 0(n) edges. We would argue, however, that such a graph G' 
would not arise "naturally." We are interested in algorithms for large, naturally 
sparse graphs. 

1.2 External-Memory Algorithms 

One well-recognized way of designing algorithms for processing large data sets 
is to formulate such algorithms in the external memory model (e.g., see the 
excellent survey by Vitter [37]). In this model, we have a single CPU with 
main memory capable of storing M items and that computer is connected to 
D external disks that are capable of storing a much larger amount of data. 
Initially, we assume the parallel disks are storing an input of size N. A single 
I/O between one of the external disks and main memory is defined as either 
reading a block of B consecutively stored items into memory or writing a block 
of the same size to a disk. Moreover, we assume that this can be done on all D 
disks in parallel if need be. 

Two fundamental primitives of the model are scanning and sorting. Scanning 
is the operation of streaming N items stored on D disks through main memory, 
with I/O complexity 



e.g., see Vitter [27] . 

Since this paper concerns graphs, we assume a problem instance is a graph 
G = (V, E), with n = \V\, m — \E\ and N = \G\ = m + n. If G is d-degenerate, 
that is, has fc-core number, d, then m < dn and N — 0(dn) — 0(n) for d = 0(1). 
We use d to denote the fc-core number of an input graph, G, and we use the term 
"d-degenerate" as a shorthand for "/c-core number equal to d." 




and sorting N items has I/O complexity 
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1.3 Previous Related Work 

Several researchers have studied algorithms for graphs with bounded fc-core 
numbers (e.g., see [113112117118] '). These methods are often based on the fact 
that the vertices in a graph with fc-core number, d, can be ordered by repeatedly 
removing a vertex of degree at most d, which gives rise to a numbering of the 
vertices, called a d-degeneracy ordering or Erdos-Hajnal sequence, such that each 
vertex has at most d edges to higher-numbered vertices. In the RAM model, this 
greedy algorithm takes 0(n) time (e.g., see [5]). Bauer et al. describe methods 
for generating such graphs and their d-degeneracy orderings at random. 

In the internal-memory RAM model, Eppstein et al. [12] show how to find all 
maximal cliques in a <i-degenerate graph in 0(d3 d / 3 n) time. Alon et al. [3] show 
that one can find a cycle of length exactly c, or show that one does not exist, in 
a d-degenerate graph in time 0(d 1_1 / fc m 2 ~ 1 /' c ), if c = 4fc — 2, time 0{dm 2 ~~ 1 t k ), 
if c = 4fc - 1 or 4fc, and time 0(d 1+1 ^ k m 2 ~ 1 / k ), if c = 4fc + 1. 

A closely related concept to a d-degeneracy ordering is a k-core decomposition 
of a graph, which is a labeling of each vertex v with the largest k such that 
v belongs to a fc-core. Such a labeling can also be produced by the simple 
linear-time greedy algorithm that removes a vertex of minimum degree with 
each iteration. Cheng et al. [5] describe recently an external-memory method for 
constructing a fc-core decomposition, but their method is unfortunately fatally 
flawecQ The challenge in producing a fc-core decomposition or d-degeneracy 
ordering in external memory is that the standard greedy method, which works 
so well in internal memory, can cause a large number of I/Os when implemented 
in external memory. Thus, new approaches are needed. 

1.4 Our Results 

In this paper, we present efficient external-memory network analysis algorithms 
for naturally sparse graphs (i.e., degenerate graphs with small degeneracy). First, 
we give a simple algorithm for computing a (2 + e)d-degeneracy ordering of a 
d-degenerate graph G = (V,E), without the need to know the value of d in 
advance. The I/O complexity of our algorithm is 0(sort(dn)). 

Second, we give an algorithm for determining whether a d-degenerate graph 
G = (V, E) contains a simple cycle of a fixed length c. This algorithm uses 
o(d 1±e ■ (fc • sort(m 2 ~i) + (4fc)! • scan{m 2 ~^)^j I/O complexity, where e is a 

constant depending on c € {4fc — 2, . . . , 4fc + 1}. 

Finally, we present an algorithm for listing all maximal cliques of an undi- 
rected (i-degenerate graph G = (V,E), with 0(3 s ^ 3 sort(dn)) I/O complexity, 
where S = (2 + e)d. 

One of the key insights to our second and third results is to show that, for 
the sake of designing efficient external-memory algorithms, using a (2 + e)d- 
degeneracy ordering is almost as good as a d-degeneracy ordering. In addition 
to this insight, there are a number of technical details that lead to our results, 
which we outline in the remainder of this manuscript. 

1 We contacted the authors and they confirmed that their method is indeed incorrect. 
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2 Approximating a d-Degeneracy Ordering 



Our method for constructing a (2 + e)d-degeneracy ordering for a d-degenerate 
graph, G — (V,E), is quite simple and is given below as Algorithm [I] Note 
that our algorithm does not take into account the value of d, but it assumes we 
are given a constant e > as part of the input. Also, note that this algorithm 
destroys G in the process. If one desires to maintain G for other purposes, then 
one should first create a backup copy of G. 



1: L«-0 

2: while G is nonempty do 

3: S <— ne/(2 + e) vertices of smallest degree in G 

4: L «— L\S // append S to the end of L 

5: remove S from G 

6: end while 

7: return L 

Algorithm 1: Approximate degeneracy ordering of vertices 



Lemma 1. If G is a d-degenerate graph, then Algorithm\lj computes a (2 + e)d- 
degeneracy ordering ofG. 

Proof. Observe that any d-degenerate graph with n vertices has at most 2n/c 
vertices of degree at least cd. Thus, G has at most 2n/(2 + e) vertices of degree 
at least (2 + e)d. This means that the ne/(2 + e) vertices of smallest degree in G 
each have degree at most (2 + e)d. Therefore, every element of set S created in 
line 3 has at most (2 + e)d neighbors in (the remaining graph) G. When we add S 
to L in line 4, we keep the property that every element of L has at most (2 + e)d 
neighbors in G that are placed behind it in L. Furthermore, note that, after we 
remove vertices in S (and their incident edges) from G in line 5, G is still at most 
d-degenerate (every subgraph of a d-degenerate graph is at most d-degenerate); 
hence, an inductive argument applies to the remainder of the algorithm. □ 

Note that, after riog( 2 + e )/2(^ n )l = O(lgn) iterations, we must have processed 
all of G and placed all its vertices on L, which is a (2 + e)d-degeneracy ordering 
for G and that this property holds even though the algorithm does not take the 
value of d into account. 

The following lemma is proved in the Appendix. 

Lemma 2. An iteration of the while loop (lines 3-5) of Algorithm [7] can be 
implemented in 0(sort(dn)) I/O's in the external-memory model, where n is 
the number of vertices in G at the beginning of the iteration. 

Thus, we have the following. 

Theorem 1. We can compute a (2 + e)d- degeneracy ordering of a d-degenerate 
graph, G, in 0(sort(dn)) I/O's in the external-memory model, without knowing 
the value of d in advance. 
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Proof. Since the number of vertices of G decreases by a factor of 2/ (2 + e) in each 
iteration, and each iteration uses 0(sort(dn)) I/O's, where n is the number of 
vertices in G at the beginning of the iteration (by Lemma [2]), the total number 
of I/O's, /(G), is bounded by 

1(G) =0^sort(dn) + sort((2/(2 + e))dn) + sort ((2/(2 + e)) 2 dn) + ■■■) 

= ( S0 ,t(dn)(l + ^- + (^-) 2 + ...)) 

= 0(sort(dn)). □ 

This theorem hints at the possibility of effectively using a (2 + e)<i-degeneracy 
ordering in place of a d-degeneracy ordering in external-memory algorithms for 
naturally sparse graphs. As we show in the remainder of this paper, achieving 
this goal is indeed possible, albeit with some additional alterations from previous 
internal-memory algorithms. 



3 Short Paths and Cycles 

In this section, we present external-memory algorithms for finding short cycles in 
directed or undirected graphs. Our approach is an external-memory adaptation 
of internal-memory algorithms by Alon et al. [3J. We begin with the definition 
and an example of a representative due to Monien |22j . A p-set is a set of size p. 

Definition 1 (representative). LetF be a collection ofp-sets. A sub- collection 
T C T is a g-representative for J-, if for every q-set B , there exists a set A 6 T 
such that A l~l B = if and only if there exists a set A £ T with this property. 

Every collection of p-sets T has a g-representative T of size at most ( P p 9 ) 
(from Bollobas [7 ). An optimal representative, however, seems difficult to find. 
Monien [22] gives a construction of representatives of size at most 0{^[ =1 p % ). 
It uses a p-ary tree of height < q with the following properties. 

— Each node is labeled with either a set A € J or a special symbol A. 

— If a node is labeled with a set A and its depth is less than q, it has exactly p 
children, edges to which are labeled with elements from A (one element per 
edge, every element of A is used to label exactly one edge). 

— If a node is labeled with A or has depth g, it has no children. 

— Let E(v) denote the set of all edge labels on the way from the vertex v to 
the root of the tree. Then, for every v: 

- if v is labeled with A, then A n E(v) = 

- if v is labeled with A, then there are no A £ T s.t. A n E(v) = 0. 

Monien shows that if a tree T fulfills the above conditions, defining T to be 
the set of all labels of the tree's nodes yields a g-representative for T . As an exam- 
ple, consider a collection of 2-sets, T = {{2, 4}, {1, 5}, {1, 6}, {1, 7}, {3, 6}, {3, 8}, 
{4, 7}, {4, 8}}. Fig. F] presents J 7 , a 3-representative of T in the tree form. 
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Fig. 1. Tree representation of T 



The main benefit of using representatives in the tree form stems from the fact 
that their sizes are bounded by a function of only p and q (i.e., maximum size of 
a representative does not depend on \J-\). It gives a way of storing paths of given 
length between two vertices of a graph in a space-efficient way (see Appendix 
for details). 

The algorithm for finding a cycle of given length has two stages. In the first 
stage, vertices of high degree are processed to determine if any of them belongs 
to a cycle. This is realized using algorithm cycleThrough from Lemma [5j Since 
there are not many vertices of high degree, this can be realized efficiently. 

In the second stage, we remove vertices of high degree from the graph. Then, 
we group all simple paths that are half the cycle length long by their endpoints 
and compute representatives for every such set (see Lemma [3]). For each pair of 
vertices (u, v), we determine (using findDis joint from Lemma |4| if there are 
two paths: p from utoii and p' from v to u, such that p and p' do not share any 
internal vertices. If this is the case, C — p U p' is a cycle of required length. 

The following representatives-related lemmas are proved in the Appendix. 

Lemma 3. We can compute a q-representative T for a collection of p-sets T , 
of size \f\ < ELiPS m 0((E£iV) ' scan(\F\)) I/O's. 

Lemma 4. For a collection of p-sets, T, and a collection of q-sets, Q, there is 
an external-memory method, f indDisjoint(J r , Q), that returns a pair of sets 
(A, B) (A £E JF , B € Q ) s.t. AD B = or returns e if there are no such pairs of 

sets. findDisjoint uses o((E?=i P i + E<=i 9*) ' scan(\T\ + \g\)) I/O's. 

Lemma 5. Let G = (V,E). A cycle of length exactly k that passes through 
arbitrary v £ V , if it exists, can be found by an external-memory algorithm 
cycleThrough(G, k, v) in 0{(k — 1)! • scan{rnj) I/O's, where m = \E\, via the 
use of representatives. 
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Before we present our result for naturally sparse graphs, we first give an 
external-memory method for general graphs. 

Theorem 2. Let G — (V, E) be a directed or an undirected graph. There is an 
external- memory algorithm that decides if G contains a cycle of length exactly 
c € {2k — 1, 2k}, and finds such cycle if it exists, that takes 0(k ■ sort(m 2 ~ ' * ) + 
(2k ~ 1)! • scan(m 2 -i)) I/O's. 

Proof. Algorithm [2] handles the case of general graphs (which are not necessarily 
naturally sparse), and cycles of length c = 2k (the case of c = 2k— 1 is analogous). 



1: A «— mi 

2: for all v - vertex of degree > A do 
3: C <- cycleThrough(G, 2k, v) 
4: if C 7^ e then 
5: return C 
6: end if 
7: end for 

8: remove vertices of degree > A from G 

9: generate all directed paths of length k in G 
10: sort the paths lexicographically, according to their endpoints 
11: group all paths u v into collection of (k — l)-sets J- uv 
12: for all pairs (J^uv^vu) do 
13: P <S— f indDisjoint(J r „„, J^u) 
14: if P = (A, B) then 
15: return C = A U B 
16: end if 
17: end for 
18: return e 

Algorithm 2: Short cycles in general graphs 

Since there are at most m/A = m 1 ^^ vertices of degree at least A, and each 
call to cycleThrough requires 0((2k — 1)! • scan(m)^j I/O's (by Lemma[5j), the 
first for loop (lines 2-7) takes C^to 1 "^ . (2k - 1)! • scan(m)) = 0((2k - 1)! • 
scan(m 2 ~i)) I/O's. 

Removing vertices of high degree in line 8 is realized just like line 5 of 
Algorithm 1 in 0(sort(m)) I/O's. There are at most mA k ~ 1 = m 2 " paths to 
be generated in line 9. It can be done in 0(k- sort(m 2 ~i)) I/O's (see Appendix). 
Sorting the paths (line 10) takes 0(sort(m 2 ~i)) I/O's. After that, creating J-" uv 's 
(line 11) requires 0(scan(m 2 ~i)) I/O's. 

The groupF procedure groups T uv and T vu together. Assume we store T^'s 
as tuples (u, v, S), for S S T uv , in a list F. By u -< v we denote that u precedes 
v in an arbitrary ordering of V. For u -< v, tuples (u, v, 1, S) from line 3 mean 
that S € T uv , while tuples (it, v, 2, S) from line 5 mean that S £ T vu . The for 
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loop (lines 1-7) clearly takes 0(scan(m 2 ~~^)) I/O's. After sorting F (line 8) in 
O (sort(m 2 ~ ^ )) I/O's, tuples for sets from F uv directly precede those for sets 
from J- vu , allowing us to execute line 9 in 0(scan(m 2 ~^ )) I/O's. 



proc groupF 


1 


for all (it, v, S) in F do 


2 


if u -< v then 


3 


write (u, v, 1, S) back to F 


4 


else 


5 


write (v, u, 2, S 1 ) back to F 


6 


end if 


7 


end for 


8 


sort F lexicographically 


9 


scan F to determine pairs (F U v i -Fuv) 



Based on Lemma |4j the total number of I/O's in calls to findDis joint in 
Algorithm [2j line 13 is 

0(E u . v (E-^(k-lY-scan(\F uv \ + \F VU \))) 

= 0((Ei=i 2 (fe ~ I) 1 ) ' £«,„ scan{\F uv \ + \F VU \)) 
= 0((2k - 1)! • scan(m 2 -i)) 

as we set p = q = k - 1 and J2t=i( k - !)* = °{( k - l) k+3 ) = 0((2k - 1)!). 

Putting it all together, we get that Algorithm [2] runs in O (sort(m 2 ~ i ) + 
(2k - 1)! • scan(m 2 -i)) total I/O's. □ 

Theorem 3. Let G = (V, E) be a directed or an undirected graph. There is an 
external- memory algorithm that, given L - a 5 -degeneracy ordering of G (for 
5 = (2 + e)d), finds a cycle of length exactly c, or concludes that it does not exist: 

(i) in o(8 l ~^ ■ (k ■ sort(m 2 ~i) + (4fe)! • scan(m 2 ~ s ))^ I/O's if c — 4fc - 2 

(ii) in o(d ■ (k ■ sort(m 2 ~i) + (4fc)! ■ scan(m 2 ~i))^ I/O's if c — 4k - 1 or 
c = 4fc 

(in) in o(6 1+ ^ ■ (k ■ sort(m 2 ~i) + (4k)\ ■ scan(m 2 ~i))^ I/O's if c = 4k + 1 

Proof. We describe the algorithm for the case of directed G, with c = 4k + 1, as 
other cases are similar (and a little easier). We assume that 8 < m 2k + 1 , which 
is obviously the case for naturally sparse graphs. Otherwise, running Algorithm 
[2] on G achieves the advertised complexity. 

Algorithm [3] is remarkably similar to Algorithm [2] and so is its analysis. 
Differences lie in the value of A and in line 9, when only some paths of length 
2k and 2k + 1 are generated. As explained in [3 , it suffices to only consider 
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1: A «- mi /5 1+ i 

2: for all v - vertex of degree > A do 
3: C «- cycleThrough(G, 4fc + 1, w) 
4: if C / e then 
5: return C 
6: end if 
7: end for 

8: remove vertices of degree > A from G 

9: generate directed paths of length 2k and 2k + 1 in G 

10: sort the paths lexicographically, according to their endpoints 

11: group all paths u ~~> v of length 2fc into collection of (2k — l)-sets J- uv 

12: group all paths u ~-> v of length 2k + 1 into collection of (2fc)-sets C/ui> 

13: for all pairs (T U v,Guv) do 

14: P f indDisjoint(J r „„, Q vu ) 

15: if P = (A, S) then 

16: return C = A U B 

17: end if 

18: end for 

19: return e 

Algorithm 3: Short cycles in degenerate graphs 



all (2k + l)-paths that start with two backward-oriented (in L) edges and all 
2fc-paths that start with a backward-oriented (in L) edge. The number of these 

paths is 0(m 2 ~i 5 1+ i ). Since we can generate them in O (k6 1+ i ■ sort(m 2 ~i)j 

I/O's (see Appendix), and there are at most 0(m}~i 5 1+ i) vertices in G of 
degree > A, the theorem follows. □ 

4 All Maximal Cliques 

The Bron-Kerbosch algorithm [8] is often the choice when one needs to list all 
maximal cliques of an undirected graph G = (V,E). It was initially improved 
by Tomita et al. |26) . We present this improvement as the BronKerboschPivot 
procedure (-T(w) denotes the set of neighbors of vertex v). 



proc BronKerboschPivot (P, R, X) 


1 


if P U X = then 


2 


output R / /maximal clique 


3 


end if 


4 


u vertex from P U X that maximizes |P (1 P (u)\ 


5 


for all v G P \ P(u) do 


6 


BronKerboschPivot (P n P(w), PU{w}, XnP(u)) 


7 


P <- P\ {«} 


8 


X <- IU {w} 


9 


end for 
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The meaning of the arguments to BronKerboschPivot: R is a (possibly non- 
maximal) clique, P and X are a division of the set of vertices that are neighbors 
of all vertices in R, s.t. vertices in P are to be considered for adding to R while 
vertices in X are restricted from the inclusion. 

Whereas Tomita et al. run the algorithm as BronKerboschPivot (V , 0,0), 
Eppstein et al. [12] improved it even further for the case of a d-degenerate G 
by utilizing its d-degeneracy ordering L — {vi, 112, . • • , v n } and by performing n 
independent calls to BronKerboschPivot. Algorithm [4] presents their version. It 
runs m time Q(dn3 d/3 ) in the RAM model. 



1: for i <— 1 . . . n do 

2: P <- r(vi) n {vj :j>i} 

3: X «- r{vi) n {«j : j < 1} 

4: BronKerboschPivot (P , ,X) 

5: end for 

Algorithm 4: Maximal cliques in degenerate graph 

The idea behind Algorithm [4] is to limit the depth of recursive calls to \P\ < d 
and then apply the analysis of Tomita et al. |26j . 

We show how to efficiently implement Algorithm [4] in the external memory 
model using a (2 + e)d-degeneracy ordering of G. Following [T2], we define 
subgraphs Hp x of G. 

Definition 2 (Graphs H PtX )- Subgraph H P<X = (V P .x,E Pt x) of G = (V, E) 
is defined as follows: 

V P}X = PUI 

E P ^x ={(u,v): (it, v) G E A (u £ P V v G P)} 

That is, ifp,jc contains all edges in G whose endpoints are from P U X, 
and at least one of them lies in P. To ensure efficiency, H P x is passed as an 
additional argument to every call to BronKerboschPivot with P and X. It is 
used in determining u at line 4 of BronKerboschPivot (we simply choose a 
vertex of highest degree in H P x)- 

The following two lemmas regarding construction of H p ,x's are proved in 
the Appendix. 

Lemma 6. Given a S-degeneracy ordering L of an undirected d-degenerate graph 
G (6 = (2 + t)d), all initial sets P, X, and graphs H P x that are passed to 
BronKerboschPivot in line 4 of Algorithm^ can be generated in 0(sort(6 2 n)) 
I/O's. 

Lemma 7. Given a S-degeneracy ordering L of an undirected d-degenerate graph 
G (5 — (2 + t)d), in a call to BronKerboschPivot that was given H p .x, with 
\P\ = p and \X\ = x, all graphs H Pnp r v \xnr(v) that have to be passed to 
recursive calls in line 6, can be formed in 0(sort(Sp 2 (p + x))) I/O's. 
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Theorem 4. Given a S-degeneracy ordering L of an undirected d-degenerate 
graph G (5 = (2 + e)d), we can list all its maximal cliques in 0(3 s ' 3 sort(Sn)) 
I/O's. 

Proof. Consider a call to BronKerboschPivot (P v , {v} , X v ) , with \P V \ = p 
and \X V \ — x. Define D(p,x) to be the maximum number of I/O's in this call. 
Based on Lemma [TJ D(p,x) satisfies the following recurrence relation: 

D(p x) < i ma - x k{k£>(p-k,x)} + 0(sort{Sp 2 (p + x))) if p > 
~~ \ e if p = 



for constant e greater than zero, which can be rewritten as 

\p + x)) 

ifp = 



D(p,x) < < max k{kD(p-k,x)} + c- Sp ( v+ x) log M/B (Sp 2 (p + x)) ifp>0 



for a constant c > 0. Since p < S and p+x < n, we have log M / B (Sp 2 (p+x)) < 
\og M / B (5 3 n) = 0(\og M / B n) for 5 = O(l). Thus, the relation for D{p,x): 

D{p, x)<{ ™MkD(p - k, x)} + Sp 2 (p + x)- c ' lo %%* " ifp>0 
) e if p = 

where c' and e are constants greater than zero. Note that this is the relation for 
D(p,x) of Eppstein et. al [12] (we set d = S, c\ = l ° D M B /B - and C2 = e). Since 
the solution for D(p,x) was D(p,x) = 0((d + x)3 p ^ 3 ), the solution for D(p,x) is 

D(p,x) = 0((5 + x)3-/» • ^MiEl) = (^3^1og M/B n) 

The total size of all sets X v passed to initial calls to BronKerboschPivot is 
0(Sn), and every set P has at most S vertices. It follows that the total number 
of I/O's in recursive calls is 

£0(^3*73! \ =0 fa/*£-hg M/B n) = 0{3 s ' 3 sort{Sn)) 

v 

Combining this with Lemma [6j we get that our external memory version of 
Algorithm Stakes 0(sort(5 2 n) + 3 s / 3 sort{5n)) = 0{3 5 / 3 sort{5n)) I/O's. □ 
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A Appendix 



A.l Proof of Lemma [2] 

The input is a d-degenerate graph G — (V, E). Let V = (1, . . . , n). We store E 
as set of edges (u,v). We assume an order on V that we utilize during sorting. 

Finding vertices of smallest degree (Algorithm [T] line 3) is realized by the 
smallVertices procedure. 



proc smallVertices 




1 


for all v - vertex of G do 




2 


d(v) <— degree of v 




3 


store pair (d(v),v) in F 




4 


end for 




5 


sort F lexicographically 




6 


S <— first ne/ (2 + e) vertices in F 


//of smallest degree 



Computing degrees of vertices in the for loop (lines 1-4) is easily realized in 
0(sort(dn)) I/O's. First, E is sorted lexicographically in 0(sort(dn)) I/O's. 
After that, edges of E form blocks ordered by their starting vertex, so a simple 
scan taking 0(scan(dn)) I/O's is enough to determine degrees of the vertices. 

Sorting F (line 5) is done in 0(sort(n)) I/O's. After that, S is just ne/(2 + e) 
first items of F and its construction (line 6) takes 0(scan(ne/(2 + e))) I/O's. 
Likewise, appending S to L (Algorithm [T] line 4) takes 0(scan(ne/(2 + e))) 
I/O's. 

Finally, removing edges adjacent to S from G (Algorithm [l] line 5) is realized 
as follows. 



1 


sort E lexicographically 


2 


for all (u, v) - edge in E do 


3 


if u € S then 


4 


add tuples (u,v, "-") and (u, it, "-") to E 


5 


end if 


6 


end for 


7 


sort E lexicographically 


8 


for all p, q - consecutive tuples in E do 


9 


if p — (u, v) and q = (u, v, "-") then 


10 


do not write p back to E 


11 


else if p = (u, v) then 


12 


write p back to E 


13 


else / ' jp= (u, v, "-") 


14 


do not write p back to E 


15 


end if 


16 


end for 



13 



Sorting E in line 1 takes 0(sort(dn)) I/O's. The first for loop (lines 2-6) takes 
0(scan(dn)) I/O's (vertices in S are stored according to the order on V, in the 
same relative order as the origins of edges in E, so it is realized by a single 
synchronized scan going through E and S at the same time). Each edge in E 
causes at most 2 tuples to be added to E in line 4, so clearly the size of E is 
0(dn) after line 6. Sorting E (line 7) obviously takes time 0(sort(dn)). 

The last for loop (lines 8-16) is easily realized by a single scan of E, in 
0(scan(dn)) I/O's. Correctness follows from two facts. First, for each edge 
(u,v), if u € S, the tuple (u, v, "-") (meaning that (u,v) does not belong to 
G after this iteration) is added to E in line 4. Second, if (u,v, "-") is in E, its 
direct predecessor is (u,v) (or another copy of (u,v, "-")) after E was sorted 
lexicographically in line 7. Therefore, the edges that are no longer in G are 
rejected in line 10. Also, no tuples (u,v, "-") are further stored in E (line 14). 

Altogether, lines 3-5 of Algorithm [T] are implemented in 0(sort(dn)) I/O's. 

□ 

A. 2 Representatives 

Monien [S3] gave a simple algorithm (which we call repQuery) that operates on 
representatives in the tree form described in Sec. [3J Given IF - a representative 
for J 7 , repQuery (J 7 , B) decides for a g-set B whether there exists a set A e T 
s.t. A n B = 0, and returns such A if it exists. The running time of repQuery is 
0(pq) in the RAM model. We use repQuery in our algorithms "as is", i.e., we 
allow it to take 0(pq) I/O's. 

Proof of Lemma [3] The size of the resulting tree is is bounded by Y2i=i "P % 
(number of nodes in a p-ary tree of height q). We can afford to build the tree one 
node at a time, spending 0(scan(p\'F\)^ I/O's on each node. Procedure repLabel 
labels vertex v in the representative tree for T in 0{scan(p\'F\)) I/O's. 



proc repLabel (J 7 , v) 


1 


A <- set A in T s.t. A n E(v) = 


2 


if A 7^ e then 


3 


label v with A 


4 


create children of v 


5 


label edges from v to its children with elements from A 


6 


else 


7 


label v with A 


8 


end if 



Set A in line 1 can simply be found by scanning T in 0(scan{p\F\)) I/O's {p\J-\ 
is the total size of all p-sets in F). Creating children of v (line 4) and labeling 
their edges (line 5) takes 0(p) I/O's. 
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Therefore, we compute a g-representative for J 7 in J3i=i P l ) ■scan{p\ .F|) j 
0((ELiP l+1 ) • *co»(l-F|)) = 0((E£V) • «*m(|.F|)) I/O's. □ 

Proof of Lemma [4] The proof is essentially the same as that of Lemma 3.2 
in [3J: first we compute a g-representative T of J 7 , in 0^(E?=i) ' scan(|J r |)^ 

I/O's, and a p-representative CJ of Q, in Of (Si=i ) ' scan {\G\)^j I/O's. 

The sizes of T and £/ are bounded by £* =1 p l an d £i=i respectively. 
Assuming p > q (w.l.o.g.), determining whether J- and contain two disjoint 
sets can be easily done in 0(£?iV • P?) = 0(E?=iV +2 ) = 0(E?=iV) 
I/O's, by querying <J (via repQuery) with all sets from J- . □ 



Proof of Lemma [5] The (very) big picture of the cycleThrough(G, k, v) 

algorithm is as follows: 



proc cycleThroughCG, k, v) 


1 


for all u s.t. (v, u) G E do 


2 


if there exists simple path p: u ~+ v of length exactly fc — 1 then 


3 


return p U (v, u) 


4 


end if 


5 


end for 


6 


return e 



Obviously, main difficulty lies in checking the condition in line [2| To show how 
we answer that query, let us first explain how 22J handles the paths. 

Let V% v denote the set of all simple paths from u to v of length exactly 
(p+ 1) (so that these paths have p inner vertices). First, all paths from u to v of 
length (p + 1) that have the exact same set of inner vertices are represented as a 
single set containing these vertices (the vertices are stored as one of the paths; it 
provides a representative of the set). Performing this compression on V? v yields 
JFP V - a family of p-sets: 

■?~uv = {5*: S" is a set of inner vertices on some path from u tov of length p +1} 

The condition from line[2]of cycleThrough is therefore equivalent to T^~ 2 being 
nonempty. We will now focus on how to test if this is the case. 

The clou of [55] was that having ^-representatives for J 7 ^ (for all u e V) 
enables efficient computation of (q — ^-representatives for J 7 ^ 1 (for all u e V). 
The labels for a (q — ^-representative tree for J-^ 1 are computed node by node. 
The algorithm is based on the following observation (7 is the node whose label 
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we compute, £'(7) is the set of edge labels on the way from 7 to root): 

3U e TP+ 1 s.t. u n £(7) = 



3w eV \ {u, v} s.t. (u,w) G E A tu ^ £(7) A 
3U g J 7 ^ s.t. E/ n (£(7) U {«}) = 

Having a ^-representative for J r ^ JV allows us to find U (or determine that it does 
not exist) via the repQuery algorithm. Determining the label for 7 is therefore 
realized as follows: 



1 


for all w s.t. (u, w) <G E and w </ E(j) do 


2 


tf<- repQuery (J£„, £(7) U {«}) 


3 


if j/ ^ e then _ 


4 


label 7 with [/ U {w} 


5 


return 


6 


end if 


7 


end for 


8 


label 7 with A 



repQuery (querying a representative tree) takes 0(pq) I/O's, so our imple- 
mentation of labeling 7 takes 0(pq ■ scan (r(u))) I/O's (where r(u) denotes 
the number of neighbors of u in G). It simply scans neighbors of u and calls 
repQuery accordingly. Because the (q — ^-representative tree for TT^ 1 has 
size bounded by J2i=i(P + 1)' — + labeling all its nodes requires 

O^p+iy- 1 ■ (pq- scan(r(u)))^J = o(q 2 (p + 1)« ■ scan(r(u))^j I/O's. We are 

computing (q— ^-representatives for J^+^s for all u's, so it takes o(^J2 u (? 2 (p+ 

l)'-scon(|r(u)|))) = o(g 2 (p+l) 9 -£ u scan(|/>)|)) = 0(g 2 (p+l)9-scan(™)) 
I/O's in total. 

Our goal is to compute O-representatives T^~ 2 for T^~ 2 (for all u G V). Then, 
by calling repQuery (J 7 ^ 2 , 0), we determine whether J 7 ^ 2 is nonempty, as it 
either returns e (if T^~ 2 is empty), or a set A € T^~ 2 , representing a path from 
u to v of length fc — 1. 

We start with (fc — 2)-representatives for J"°„'s. They are built as trees having 
only the root vertex, labeled with either (if (u, v) e E), or A (otherwise). They 
can be obviously constructed in 0(scan(m)) I/O's, via scanning E. Based on 
the discussion above, computing T^~ 2, s (for all u <^V) takes 

0(E k p Z 3 o( k - 2- P )(p+l) k - 2 - p ■ scan{m)) 
= 0((fc - 2) • scan{m) • EpJ(;P + l) fe - 2 - p ) 
= 0((k-2)- scan(m) ■ (fc - 2)!) 
= 0((k - 1)\ ■ scan{m)) 
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I/O's, as it can be easily shown by induction that X]p=o(P + 2 p < (k — 2)! 
for k > 6. 

A. 3 Path Generation 

General Graphs (Algorithm [2]) Recall that for Algorithm [2] we need to 
generate all directed paths of length fc in a graph G = (V,E), where maxi- 
mum degree of each vertex is bounded by A. As shown in [3], there are at 
most 0{mA l ) — 0(m 2 ~k) such paths. They are generated by the pathGen 
procedure. 



proc pathGen 






1 


generate all sequences of length k 


— 1, with elements from {1, . . 




2 


for all s - sequence € {1, . . . , A} k 


1 do 




3 


generate sequences e\s, for all e 


€ E 




4 


end for 






5 
6 


for all s - sequence € E X {1, . . . 
decode s into s' € E k 


A}*- 1 do 




7 

8 


if s' 7^ e then 

if s' is a simple path then 






9 


output s' 






10 


end if 






11 


end if 






12 


end for 







Line 1 is simply realized in 0(scan(A k ~ 1 )) = O^sca^m 1 ^^)) I/O's. Adding 
an edge at the beginning of each sequence in the first for loop (lines 2-4) takes 
0(m ■ scan(A k ~ 1 )) — 0(scan(m 2 ~* )) I/O's. 

Decoding a sequence s G Ex {1, . . . . Z\} fe_1 into a path s' G £ ,fe (pathGen, 
line 6) is conceptually straightforward, e is the first edge in the path. Then, 
each consecutive number i determines next vertex on the path - ith neighbor of 
the previously decoded one (if it has less than i neighbors, the path is dropped 
as invalid). The decodePaths procedure handles decoding of S - the set of 
sequences. s[i] is the zth element of tuple s, s[i] .from is the origin, and 
s [i] .to is the destination vertex of edge at s [i] . 

The first for loop (lines 1-3) decodes the starting edge of the sequence s into 
two vertices, and then cyclically shifts the resulting tuple by one position. It 
takes 0(scan(m 2 ~k^ I/O's. Each iteration of the second for loop (lines 4-12) 
decodes the next vertex of s and again cyclically shifts s by one position. Sorting 
S in line 5 takes 0[sort(m 2 ^^)) I/O's. After that, the inner loop (lines 6-11) 
requires only 0(scan(m 2 ~i)^ I/O's (it takes one synchronized scan of S and 
E). Invalid paths that do not meet the condition at line 8 are dropped. Since the 
outer for loop runs for k — 1 iterations, decodePaths uses 0{k ■ sort(m 2 ~ & )) 
I/O's. 
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proc decodePaths (S) 




1 


for all s - sequence € E X {1, . . . , Z\} fe_1 do 




2 


write tuple (s [1] . to, s [2] , s [3] , . . . , s [k] , s [1] . 


from) to S 


3 


end for 




4 


for all i <— 1, . . . , k — 1 do // 5 contains tuples 


V x {l,...,A} k - 1 x V 1 


5 


sort 5 lexicographically 




6 


for all s - tuple in S do 




7 


u s [2] th neighbor of s [1] in 1/ 




8 


if u 7^ e then 




9 


write tuple (it, s [3] , s [4] , . . . , s [k+1] , s [1] ) 


back to 5 


10 


end if 




11 


end for 




12 


end for 





Verifying that a path is simple (pathGen, line 8) is done by checking that it 
does not contain repeated vertices. Thus, pathGen takes 0{sort(rn 2 ~ nyj I/O's. 



Degenerate Graphs (Algorithm [3]) For Algorithm [3j we need to generate 
all paths of length 2k + 1 that start with two backward-oriented (in L) edges. As 
shown in g], there are at most O(mJ2^ =0 (^A^ 2 ^) = 0(2 2k mA k 5 k ) paths 
of length 2k + 1 in G. It follows from the fact that for each path p, either p or 
p R (the reverse of p) has at most k edges with opposite directions than in L. 

The procedure pathGenForward generates paths p that have at most k edges 
with opposite directions than in L. 



proc pathGenForward 




1 


generate sequences s = u\v, for all (u,v) G E 




2 


for i «— 1 . . . 2k do 




3 


for all s - sequence £ V 2 x ({"L", "E"} x {1, . . 


,A}) 1 - 1 do 


4 


generate all sequences s\("L",j), for j e {1, . . 


.,5} 


5 


if s has < k pairs of the type ("E", j) then 




6 


generate all sequences s\("E",j), for j G {1, 


...,A} 


7 


end if 




8 


end for 




9 


end for 


max{(5, Z\}}) 2/ ° do 


10 


for all s - sequence e V 2 x ({"L", "E"} x {l, . . . , 


11 


decode s into s' e E 2k+1 


12 


if s' e then 




13 


if s' is a simple path then 




14 


output s' 




15 


end if 




16 


end if 




17 


end for 
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The encoding of the sequences works as follows: it starts with two vertices, u 
and v, that represent the starting edge of the path, v is followed by 2k pairs of 
the format ("L",i) (with i G {1, . . . ,5}) or ("E",i) (with i e {1,.. -,A}). ("L",i) 
means that the next vertex is the ith neighbor of the current vertex in degeneracy 
ordering L, and ("E",i) means that the next vertex is the ith. neighbor of the 
current vertex in E. Any sequence s has at most k pairs of the type ("E", j), 
and only edges represented by them may have the opposite direction than in L. 

The number of sequences generated by the first for loop (lines 2-9) is clearly 
O(m^2 i=0 ( 2k )A l 5 2k ~' 1 ) = 0(2 2k mA k 5 k ), and the whole generation process 
takes 0(scan(2 2k mA k 6 k )) I/O's. The sequences are decoded into paths in the 
second for loop (lines 10-17) in a manner similar to decodePaths, using 0((2fc+ 
1) • sort(2 2k mA k 8 k ) I/O's. 

Paths of length 2k + 1, that have at most k edges in the opposite direction 
than in L when they are read backwards, can be generated by an analogous 
procedure pathGenBackward (using E R - reversed edges instead of E), with the 
same I/O complexity. 

Procedure pathGen2 generates all paths of length 2k + 1 that start with two 
backward-oriented (in L) edges. 



proc pathGen2 


1 


generate paths of length 2k — 1, via pathGenForward 


2 


generate paths of length 2k — 1, via pathGenBackward 


3 


for all s - generated path of length 2k — 1 do 


4 


generate all sequences i\j\s, for € {1, . . . ,S} 2 


5 


end for 


6 


for all s - sequence e {1, . . . , d} 2 x E 2k ~ x do 


7 


decode s into s' e E 2k+1 


8 


if s' 7^ e then 


9 


if s' is a simple path then 


10 


output s' 


11 


end if 


12 


end if 


13 


end for 



Paths generated in lines 1-2 are the tails of the resulting paths. The two 
numbers in sequences added to these paths in line 4 denote the edges in L that 
are to be taken to determine first two vertices on the final path, starting at the 
tail's first vertex. This assures that these edges are backward-oriented in L. 

The number of paths generated by pathGen2 is 0(6 2 ■ 2 2k ~ 2 mA k ~ 1 5 k ~ 1 ) = 
0(2 2k - 2 mA k - 1 S k+1 ) = 0(2 2k - 2 m 2 -iS 1+ ^), and its I/O complexity is 0(k ■ 
sort(2 2k - 2 m 2 -U 1+ i)). 

Since generating all paths of length 2k that begin with a backward-oriented 
(in L) edge is essentially the same as pathGen2, path generation in this case 
requires 0(k ■ sort(2 2k - 2 m 2 -i S 1+ i)) I/O's. 
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A. 4 Graph Reordering 



Assume we are given a graph G = (V,E), with V — (1,2, and d- 

degeneracy ordering L = (vi, 1)2, . . . , v n ) of V . Our goal is to reorder G according 
to L, i.e., substitute each edge (t>i,Uj) <G E with an edge 



proc reorderG(G, L) 
1: for all k <- 1,2 do 

2: for all t>i - ith vertex in L do 
3: append tuple (vi, "i") to E 
4: end for 

5: sort E lexicographically 
6: for all p - tuple in E do 
7: if p — (u, v) then 
8: q <— tuple (u, "z") // precedes p 

9: write (v, «) back to E 

10: else 

11: do not write (v, i) back to E 

12: end if 
13: end for 
14: end for 



A single iteration of the outer for loop (lines 1-14) first renames origins of edges 
in E and then reverts them (thus, after 2 iterations edges have their original 
directions). First, it adds vertices along with their positions in L to E in lines 
2-4. This takes 0(scan(n)) I/O's. Then it sorts E (line 5) in 0(sort(dn)) I/O's. 
The next for loop (lines 6-13) scans E, renames origins of edges to their positions 
in L and outputs their opposite versions. The tuples q obtained in line 8 can 
clearly be found in the same single scan (tuple (u, "i") directly precedes all edges 
(u,v) after E was sorted), so this part is done in 0(scan(dn)) I/O's. 
Therefore, reorderG runs in 0(sort(dn)) I/O's altogether. 

A. 5 Proof of Lemma [6] 

First, observe that the total size of all sets P and X passed to initial calls to 
BronKerboschPivot is 0(m) = O(Sn). To see this, note that every edge (vi,Vj) 
(with Vi preceding vj in L = (vi, i>2, ■ • • , v n )) puts Vi into initial X for R = {vj} 
and Vj into initial P for R = {vi}. Since the size of each P is at most S, the total 
number of edges in all Hp^'s is 0(5 2 n). 

Our approach is to generate and store all P's, AT's and _ffp,x's at the very 
beginning of the algorithm, and then just pass appropriate P, X, and Hp.x to 
each initial call to BronKerboschPivot. 

Procedure genPX generates all initial P's and AT's in 0(sort(Sn)) I/O's. We 
assume that for each vertex v *E V ~we know its position in L (i.e., we know i for 
v = Vi). Also, for each edge (u,v) g E, we know positions in L of its endpoints 
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(i.e., we know both i and j for (u, v) — (vi,Vj)). We can easily achieve this in 

L 

0(sort(5n)) I/O's (see Sec. A. 4 1. By u < v, we denote that u precedes v in L. 
The for loop (lines 1-5) clearly takes 0(scan(5n)) I/O's. 



proc genPX(G, L) 


1 


for all {vi, Vj) - edge in E do 


2 


L 

if Vi -< Uj then 


3 


output tuples («i, "P",Wj), and (vj, "V,Vi) to set P 


4 


end if 


5 


end for 


6 


sort P lexicographically 


7 


scan P to create sets P and X 



The meaning of (u,, "P",^) is "add to P-i,/', and of (vj, "J.",Vi): "add Vi 
to X„ . " . The above discussion explains why this information allows to generate 
all P's and X's. Therefore, after set F is sorted in 0(sort(8n)) I/O's in line 6, 
generation of P's and X's in line 7 takes 0(scan(6n)) I/O's. 

Procedure genH generates all initial Hpx's in 0(sort(S 2 n)) I/O's. Each ver- 
tex has at most S neighbors that succeed it in L, so there are 0(5 2 n) tuples added 
to E in the first for loop (lines 1-5). They can be generated in 0(scan(8 2 n)) 
I/O's, as the edges in E are sorted lexicographically. 



proc genH(G, L) 
1: for all u - vertex in V do 

L L 

2: for all v, w - neighbors of u s.t. u -< v -< w do 
3: append tuple (v,w,lu?) to E 
4: end for 
5: end for 

6: sort E lexicographically 
7: for all p - tuple in E do 
8: if p = (v, w, ?w?) then 

9: q <— tuple of form (vi,Vj) immediately preceding p in E 
10: if q = (y, w) then 

11: output tuples (H v ,u,w) and (H v ,w,u) to set H 

12: end if 

13: do not write p back to E 
14: else I jp= (v, w) 
15: write p back to E 
16: end if 
17: end for 

18: sort H lexicographically 
19: scan H to create sets H v 

20: scan edges of H v 's and mark the endpoints that belong to P„'s 
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To understand the meaning of tuples from line 3, refer to Fig. [2| 




Fig. 2. v and w are neighbors of u, and u -< v -< it), so the tuple (v,w,?u?) is output 
in genH, line 3. If (w, tu) G 25, the edge (u, w) is in 22„. 

After E is sorted in line 6 in 0(sort(S 2 n)) I/O's, the next for loop (lines 
7-17) identifies edges that belong to sets H v . In E, each edge (v,w) is followed 
by zero or more tuples of the form (v,w, ?«?). This makes it easy to determine 
q in line 9, and therefore, the loop takes 0(scan(S 2 n)) I/O's. 

As explained in Fig. [2j (v,w) followed by (v,w, lul) means that the edge 
(u, w) has to be added to H v . It is denoted by the tuples (H v , u, w) and {H v , w, u) 
output in line 11. Sorting H in line 18 takes 0(sort(S 2 n)) I/O's, and after that, 
-ffp.x's are generated in line 19 in 0(scan(6 2 n)) I/O's. The marking of endpoints 
(line 20) also takes 0(scan(8 2 n)) I/O's. 

Therefore, total complexity of generating initial P's, X's, and Hp^s is 
0(sort{5 2 n)) I/O's. □ 

A. 6 Proof of Lemma 



proc updateH(u) 


1 


for all Hp x ~ candidate do 


2 


for all e - edge in Hp x do 


3 


if e = (u, v) or e = (v, w) then 


4 


unmark v in e 


5 


end if 


6 


end for 


7 


for all e - edge in Hp x do 


8 


if at least one vertex of e is marked then 


9 


write e back to Hp x 


10 


else 


11 


do not write e back to Hp.x 


12 


end if 


13 


end for 


14 


end for 
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Recall that |P| = p and |X| = x. Our idea in computing Hp^x's is to first 
generate candidates for Hp tX 's. Candidates are defined as Hp^x's as they would 
be if there were no lines 7 and 8 in BronKerboschPivot (i.e., as if P and X 
did not change). We then update the candidates according to lines 7 and 8 of 
BronKerboschPivot. 

Generation of the candidates for Hp^s is almost the same as in genH, only 
v and w in line 2 are now taken from P, so it uses 0(sort(p 2 (p + x))) I/O's. 
Updating Hp x 's (moving v from P to X) is realized by the procedure updateH. 

Both inner for loops (lines 2-6 and 7-13) clearly take 0(scan(\Hp^x\)) I/O's. 
Since the total size of all candidates is 0(p 2 (p + x)), a single call to updateH 
takes 0(scan(p 2 (p + x))) I/O's. There are at most p such calls, so generating 
Hp t x's that are passed to recursive calls in line 6 of BronKerboschPivot takes 
0(sort(p 2 (p + x)) + p- scan(p 2 (p + x))) = 0(sort(Sp 2 (p + x))) I/O's. □ 
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